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This  paper  presents  an  overview  of  existing  research  on  wind  speed  and  power  forecasting.  It  first 
discusses  state-of-the-art  wind  speed  and  power  forecasting  approaches.  Then,  forecasting  accuracy  is 
presented  based  on  variable  factors.  Finally,  potential  techniques  to  improve  the  accuracy  of  forecasting 
models  are  reviewed.  A  full  survey  on  all  existing  models  is  not  presented,  but  attempts  to  highlight  the 
most  promising  body  of  knowledge  concerning  wind  speed  and  power  forecasting. 
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1.  Introduction 

Wind  power  is  one  of  the  most  rapidly  growing  renewable 
energy  sources,  and  is  regarded  as  an  appealing  alternative  to 
conventional  power  generated  from  fossil  fuel.  This  led  to  a 
collaborative  effort  to  achieve  20%  of  U.S.  electricity  supplied  from 
wind  power  by  2030  [1].  Although  the  integration  of  wind  power 
brings  many  advantages,  high  penetration  of  wind  power  provides 
a  number  of  challenges  in  power  system  operations  and  planning, 
mainly  due  to  its  uncertain  and  intermittent  nature.  In  the 
electricity  system  the  power  supply  must  be  equal  to  the  power 
demand  at  all  times.  However,  the  variation  of  wind  power  output 
makes  it  difficult  to  maintain  this  balance. 

One  of  the  possible  solutions  to  the  balance  challenge  is  to 
improve  the  wind  speed  and  power  forecasting.  Research  in  the 
area  of  forecasting  wind  speed  or  the  power  produced  by  wind 
farms  has  been  devoted  to  the  development  of  effective  and 
reliable  tools  and  many  different  approaches  have  been  proposed 
and  reviewed  in  [2—13],  Accurate  forecasting  tools  reduce  operat¬ 
ing  costs  and  improve  reliability  associated  with  the  integration  of 
wind  power  into  the  existing  electricity  supply  system  [14-29], 

There  are  different  users  of  wind  speed  and  power  forecasts.  These 
users  not  only  need  point  forecasts  but  also  the  uncertainty  of  the 
forecast  is  essential  for  determining  the  size  of  the  operating  reserves 
necessary  to  balance  the  generation  with  load.  For  the  market 
operator,  the  forecast  of  the  total  wind  power  production  in  a  region 
is  needed  more  than  the  individual  wind  plant  forecast.  For  the  system 
planner,  it  is  essential  to  plan  the  installation  of  new  wind  turbines. 
The  desired  forecasting  capabilities  typically  vaiy  among  user  groups. 
This  paper  divides  the  forecasting  methods  into  five  categories:  wind 
speed  and  power  forecasting,  spatial  correlation  forecasting,  regional 
forecasting,  probabilistic  forecasting,  and  offshore  forecasting. 

A  large  amount  of  research  has  been  directed  toward  the 
development  of  accurate  and  reliable  wind  speed  and  power 
forecasting  models  and  many  different  approaches  have  been 
developed.  However  it  is  difficult  to  draw  conclusions  as  to  which 
model  is  the  best  because  each  model  in  use  has  significant  site 
dependencies.  Thus,  a  forecasting  model  may  perform  well  at  its 
site,  but  this  does  not  guarantee  that  the  model  will  work  well  at 
another  site.  This  paper  discusses  forecasting  accuracy  based  on 
variable  factors  that  are  used  in  the  forecast. 

Furthermore,  several  potential  techniques  for  improving  fore¬ 
casting  accuracy  have  been  reported  in  the  literature.  These  are 
also  discussed  and  reviewed  in  this  paper. 

The  paper  is  organized  as  follows.  Section  2  presents  an  overview 
of  existing  wind  speed  and  power  forecasting  approaches.  In  Section 
3,  the  general  forecasting  accuracy  is  discussed.  Some  of  the  potential 
techniques  to  improve  the  performance  of  forecasting  models  are 


presented  in  Section  4.  Finally,  conclusions  and  future  works  are 
drawn  in  Section  5. 

2.  Overview  of  current  wind  speed  and  power  forecasting 

2.3.  Wind  speed  and  power  forecasting 

The  basic  role  of  wind  speed  and  power  forecasting  is  to 
provide  information  about  the  wind  speed  and  power  that  can 
be  expected  in  the  next  few  minutes,  hours,  or  days.  Based  on 
power  system  operation  requirements,  the  forecast  can  be  divided 
into  four  different  horizons:  very  short-term  (few  seconds  to 
30  min),  short-term  (30  min  to  6  h),  medium-term  (6-24  h),  and 
long-term  (1-7  days)  [10,11],  Very  short-term  forecasts  are  used 
for  turbine  control  and  load  tracking.  Short-term  forecasts  are 
utilized  for  preload  sharing.  Medium-term  forecasts  are  used  for 
power  system  management  and  energy  trading.  Long-term  fore¬ 
cast  are  used  for  maintenance  scheduling  of  the  wind  turbines. 

Research  in  the  area  of  forecasting  wind  speed  and  the  power 
produced  by  wind  farms  has  been  devoted  to  the  development  of 
effective  and  reliable  tools  and  many  different  approaches  have 
been  proposed.  These  tools  can  be  classified  whether  the  terrain 
information  at  the  location  is  used  as  an  input  or  not.  Two 
mainstream  approaches  are  the  physical  and  the  statistical 
approach.  In  some  models  a  combination  approach  is  used  in  an 
attempt  to  integrate  the  advantages  of  both  approaches.  In  this 
section  an  overview  of  existing  wind  speed  and  power  forecast 
approaches  is  presented. 

2.3.3.  Physical  forecasting  approach 

The  physical  approach  to  forecasting,  in  contrast  to  statistical 
approach,  uses  the  detailed  physical  description  to  model  the  on¬ 
site  conditions  at  the  location  of  the  wind  farm  [7,30],  The  basic 
operation  of  a  physical  approach  is  illustrated  in  Fig.  1. 

It  carries  out  the  refinement  of  the  Numerical  Weather  Predic¬ 
tion  (NWP)  data  to  take  into  account  the  on-site  conditions  by  the 
downscaling  method,  which  are  based  on  the  physics  of  the  lower 
atmospheric  boundary  layer.  The  downscaling  method  requires 
the  detailed  physical  descriptions  of  the  wind  farms  and  their 
surroundings,  including:  description  of  the  wind  farm  (wind  farm 
layout  and  wind  turbine  power  curve,  etc.)  and  description  of  the 
terrain  (orography,  roughness,  obstacles,  etc.).  Then,  the  refined 
wind  speed  data  at  the  hub  height  of  the  wind  turbines  is  plugged 
into  the  corresponding  wind  power  curve  to  calculate  the  wind 
power  production.  If  the  on-line  data  is  available,  model  output 
statistics  are  performed  to  reduce  the  error  of  the  forecast. 
Contrary  to  the  statistical  approach,  the  physical  approach  does 
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Fig.  1.  The  physical  approach  to  forecasting  wind  speed  and  power. 


not  require  training  input  from  historical  data.  However,  acquiring 
the  physical  data  is  one  of  the  main  drawbacks  of  the  approach. 

A  number  of  physical  approaches  have  been  introduced 
[2,7,8,31,32],  The  Prediktor  is  developed  by  the  Risoe  National 
Laboratory  in  Denmark.  It  uses  Wind  Atlas  Analysis  and  Applica¬ 
tion  Program  (WAsP)  and  PARK  program  to  take  the  local  condi¬ 
tions  into  account  by  using  the  NWP  forecast  from  High  Resolution 
Limited  Area  Model  (H1RLAM)  [33-35],  The  Previento,  developed 
by  the  University  of  Oldenburg  in  Germany  has  a  similar  physical 
approach  but  uses  a  different  NWP  forecast  from  Lakelmodell  of 
the  German  Weather  Service  [36],  The  LocalPred  is  developed  by 
CENER  -  National  Renewable  Energy  Centre  in  Spain.  It  involves 
adaptive  optimization  of  the  NWP  forecast,  time  series  modeling, 
meso-scale  modeling  with  MM5,  and  power  curve  modeling  [37], 
The  eWind,  developed  by  AWS  TrueWind  Inc.  in  the  USA,  has  a 
similar  physical  approach  with  Prediktor  but  uses  a  high-resolution 
boundary  layer  model  (ForeWind)  as  a  numerical  weather  model 
to  take  the  local  conditions  into  account  [38], 

The  physical  approaches  are  based  on  the  models  using  the 
fundamental  physical  principles  for  conservation  of  mass,  momentum, 
and  energy  in  air  flows.  These  models  address  computational  fluid 
dynamics  (CFD)  for  simulating  the  atmosphere.  Although  there  are 
many  CFD  models  available,  they  are  all  based  on  the  same  basic 
physical  principles.  They  differ  in  how  the  grids  are  structured  and 
scaled,  and  how  the  numerical  computations  are  performed. 

In  the  majority  of  cases  the  statistical  approaches  provide  good 
results  in  short-term,  medium-term,  and  long-term  forecasting. 
However,  in  the  very  short-term  and  short-term  horizon,  the  influ¬ 
ence  of  atmospheric  dynamics  becomes  more  important,  and  in  these 
cases  the  use  of  physical  approaches  becomes  essential. 


2.1.2.  Statistical  forecasting  approach 

The  alternative  main  approach  for  wind  speed  and  power  fore¬ 
casting  is  based  on  statistical  modeling.  The  statistical  approach 


represents  the  relation  between  wind  power  or  speed  forecasting 
and  explanatory  variables  including  NWPs  and  on-line  measured  data 
[7[.  The  general  form  of  the  model  is  illustrated  in  Fig.  2. 

The  statistical  approach  generally  uses  previous  history  data  to 
build  the  statistical  model.  This  model  uses  NWP  forecast  for  time 
t+k  and  on-line  measurement  at  time  t  to  forecast  the  present 
over  the  next  few  hours.  It  is  easy  to  model  and  inexpensive. 
However,  contrary  to  the  physical  approach,  the  statistical 
approach  requires  historical  data  to  train  the  statistical  model. 
Many  different  approaches  are  employed  [2,3,5,6,11,13],  Some  of 
the  representative  statistical  approaches  are  reviewed  in  this 
section. 

2.2.2.I.  Conventional  statistical  approach.  In  the  conventional 
statistical  approach  a  time  series  model  is  applied  to  forecast 
future  wind  power  or  speed.  According  to  the  forecasting  process, 
which  was  proposed  by  Box-Jenkins,  this  model  is  divided  into 
four  main  steps  to  make  a  mathematical  model  of  the  problem 
including  model  identification,  model  estimation,  model 
diagnostics  checking,  and  forecasting.  Several  types  of  time 
series  models  may  be  considered,  including  autoregressive 
model  (AR),  moving  average  model  (MA),  autoregressive  moving 
average  model  (ARMA),  and  autoregressive  integrated  moving 
average  model  (ARIMA).  The  general  form  of  the  model  is 

Xt  =  c+E[+  2  tpiXc_i+  2  diet ~i  (1) 

i  =  1  i  =  1 

where  Xt  represents  the  forecasting  parameter  at  time  t,  (p{  is  the 
autoregressive  parameter,  #,■  is  the  moving  average  parameter,  c  is 
the  constant,  and  random  variable  et  is  the  white  noise.  This  model 
represents  the  ARMA  model  having  the  autoregressive  model  of 
order  p  and  the  moving  average  model  of  order  q  (ARMA(p,q)).  If 
the  order  of  the  moving  average  model  (q)  is  zero,  it  represents  the 
autoregressive  model  of  order  p  (AR(p)).  If  the  order  of  the 
autoregressive  model  (p)  is  zero,  it  represents  the  moving 
average  model  of  order  q  (MA(q)).  The  ARIMA  model  is  a 
generalization  of  an  ARMA  model. 

The  ARMA  model  is  used  for  wind  power  forecasting  in  U.S. 
wind  farms  in  [39,40],  Some  of  the  models  offer  improvements 
over  the  persistence  model  but  some  are  not  by  changing  the 
order  of  AR  and  MA.  The  performance  of  the  model  is  highly 
dependent  on  the  parameters  of  the  model. 

The  Improved  Time  Series  Method  (ITSM)  based  on  ARIMA  is 
used  in  [41  ].  The  sub-wind  speed  series  obtained  from  the  wavelet 
decomposition  is  forecasted  and  then  the  aggregate  calculation  of 
the  sub-series  is  performed  to  obtain  the  final  wind  speed  forecast. 
The  simulation  results  indicate  that  the  proposed  method 
improves  the  accuracy  of  forecasting  over  the  classical  time  series 
model  and  the  neural  network  model. 

The  Autoregressive  Conditional  Heteroscedastic  (ARCH)  model 
is  combined  with  the  ARIMA  model  to  consider  the  heteroscedas- 
ticity  effect  of  the  residual  series  [42],  The  ARIMA- ARCH  model  is 
used  to  forecast  the  sub-wind  speed  series  obtained  from  the 
wavelet  decomposition.  The  final  wind  speed  forecasting  is  the 
sum  of  these  forecast  values.  The  results  show  that  it  can  improve 
the  accuracy  of  the  forecasting.  In  [43],  the  ARIMA-ARCH  model  is 
used  to  forecast  wind  speed  itself.  It  is  shown  that  the  ARIMA- 
ARCH  model  shows  better  performance  than  the  ARIMA  model. 

To  deal  with  the  non-stationary  of  wind  speed,  wind  speed 
data  are  transformed  to  a  Gaussian  distribution  and  standardized 
to  remove  the  non-stationary  in  [44],  The  parametric  AR  model  is 
then  applied  to  the  hourly  wind  speed  data  and  is  updated  during 
the  real-time  operation  by  a  recursive  least  squares  algorithm.  The 
method  is  validated  by  using  the  wind  speed  data  from  a  wind 
power  site. 
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Fig.  2.  The  statistical  approach  to  forecasting  wind  speed  and  power. 


In  [45],  the  ARMA  model  is  used  to  forecast  the  tuple  of  wind 
speed  and  direction.  Four  different  approaches  are  proposed  and 
their  performances  are  compared;  component,  traditional-linked 
ARMA,  vector  AR  (VAR),  and  restricted  VAR  approaches.  The 
component  approach  is  better  than  traditional-linked  ARMA  for 
wind  direction  forecasting  but  not  for  wind  speed  forecasting.  The 
VAR  approach  offers  better  performance  in  wind  direction  and 
close  performance  in  wind  speed  comparing  with  traditional- 
linked  ARMA.  There  is  little  difference  in  terms  of  forecasting 
performance  between  VAR  and  restricted  VAR  approaches. 

The  AR  model  using  a  Bayesian  approach  is  used  to  forecast  the 
wind  speed  in  [46],  First,  Box-Cox  transformation  is  performed  to 
correct  the  non-normality  of  the  wind  speed.  Then,  the  Markov 
Chain  Monte  Carlo  (MCMC)  simulation  is  performed  to  estimate 
the  AR  model  parameters,  which  can  then  be  used  to  forecast  wind 
speeds.  It  shows  the  great  potential  and  possible  improvements  of 
a  Bayesian  framework. 

In  summary,  conventional  statistical  approaches  are  based  on 
classical  linear  statistical  models  such  as  AR,  MA,  ARMA,  and  the 
Box-Jenkins  approach,  where  Box -Jenkins  is  based  on  ARIMA  or 
seasonally  adjusted  ARIMA  models,  also  known  as  SARIMA  mod¬ 
els.  However,  the  references  show  that  the  forecasting  accuracy 
can  be  improved  depending  on  the  model  parameters  that  are 
selected. 

The  conventional  statistical  approaches  are  mostly  aimed  at 
very  short-term  and  short-term  forecasting.  These  models  are  easy 
to  formulate  and  are  capable  of  providing  timely  forecasts.  As 
forecasting  accuracy  is  improved,  conventional  statistical 
approaches  are  often  used  as  a  reference  model. 


2.I.2.2.  Artificial  Neural  Network  approach.  Another  approach  is 
Artificial  Neural  Networks  (ANN).  The  NWP  forecasts  and  further 
meteorological  variables  are  transformed  into  the  wind  power  or 
speed  forecast  by  ANN  which  has  been  trained  by  the  large  sets  of 
historical  data  in  order  to  learn  the  dependence  of  the  output  on 
input  variables.  The  general  ANN  approach  for  wind  speed  and 
power  forecast  is  shown  in  Fig.  3. 


ANN  is  one  of  the  widely  used  statistical  approaches  for  wind 
speed  and  power  forecasts.  It  consists  of  an  input  layer,  one  or 
more  hidden  layers,  and  an  output  layer.  Each  layer  has  a  number 
of  artificial  neurons,  and  it  uses  a  connectionist  approach  to 
connect  the  neurons  to  the  neurons  of  the  previous  layer.  This 
approach  is  able  to  model  the  complex  non-linear  relationship 
between  the  input  and  output  layers  through  a  training  and 
learning  process.  This  approach  does  not  require  explicit  mathe¬ 
matical  expressions  as  used  in  the  physical  and  statistical 
approaches  reviewed  previously.  Furthermore,  it  has  the  ability 
of  self-learning,  self-organizing  and  self-adaption. 

In  [47],  the  ANN  methodology  using  the  Markov  Chain 
approach  is  proposed  for  very  short-term  wind  speed  forecast. 
The  Markov  Chain  is  applied  to  modify  the  preliminary  forecast 
obtained  from  ANN  according  to  the  long-term  patterns.  A  data  set 
with  2.5  s  resolution  is  used  to  evaluate  the  performance.  The 
results  show  the  effectiveness  of  the  proposed  method. 

The  ANN  methodology  using  Particle  Swarm  Optimization 
(PSO)  is  proposed  for  short-term  wind  power  forecast  in  [48], 
PSO  is  employed  to  select  the  input  variable  from  the  several 
nearby  locations.  The  proposed  method  shows  the  reduction  of  the 
forecast  error  comparing  with  ANN. 

Furthermore,  the  forecast  method  composed  of  a  new 
Enhanced  Particle  Swarm  Optimization  (EPSO)  technique  and 
Modified  Hybrid  Neural  Network  (MHNN)  is  proposed  to  have 
high  learning  capability  and  avoid  the  overfitting  problem  and 
trapping  in  local  minima  and  dead  bands  in  [49],  The  effective 
two-stage  feature  selection  technique,  enhanced  Mutual  Informa¬ 
tion  (Ml),  is  applied  to  select  the  most  informative  candidate 
inputs  by  filtering  out  both  irrelevant  and  redundant  candidate 
inputs  for  the  forecast  method.  The  results  confirm  the  validity  of 
the  developed  approach. 

In  [50],  the  forecast  method  based  on  Ridgelet  Neural  Network 
(RNN)  is  proposed  as  the  forecast  engine.  The  inputs  of  the  forecast 
engine  are  selected  by  the  Mutual  Information  (MI)  based  feature 
selection  component  among  the  set  of  candidate  inputs.  A  New 
Differential  Evolution  (NDE)  algorithm  with  novel  crossover  opera¬ 
tor  and  selection  mechanism  is  proposed  to  train  the  RNN.  The 
results  show  the  effectiveness  of  the  proposed  method. 
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Fig.  3.  ANN  approach  for  wind  speed  and  power  forecast. 


In  [51],  a  comprehensive  comparison  study  of  typical  three 
types  of  ANN  approach  is  performed;  Feed  Forward  Back- 
Propagation  (FFBP),  Radial  Basis  Function  (RBF),  and  Adaptive 
Linear  Element  (ADAL1NE)  approaches.  It  is  found  that  different 
inputs  and  learning  rates,  as  well  as  model  structures,  directly 
influence  the  forecast  accuracy.  Furthermore,  none  of  the 
approaches  outperform  others  universally  based  on  the  evaluation 
criteria  even  for  the  same  wind  dataset. 

Complex-Valued  Neural  Network  (CVNN)  is  proposed  for  the 
wind  power  forecast  in  [52],  The  CVNN  used  the  wind  vector  (both 
wind  speed  and  direction)  rather  than  real-valued  data  as  an 
input.  The  results  show  the  improvement  in  the  forecast  accuracy 
than  Real-Valued  Neural  Network  (RVNN). 

In  [53],  an  ANN  approach  in  conjunction  with  the  adaptive 
Bayesian  learning  and  the  Gaussian  process  approximation  is 
proposed  for  very  short-term  wind  power  forecast.  The  Bayesian 
framework  is  employed  updating  to  calculate  the  posterior  prob¬ 
ability  distribution  for  the  weights  of  the  model.  The  Gaussian 
process  approximation  makes  the  calculation  of  this  posterior 
probability  possible  to  solve  an  integration  problem  of  the  Baye¬ 
sian  learning.  The  Bayesian  framework  provides  not  only  point 
forecast  but  also  the  interval  about  the  forecast. 

A  three-layered  Feed-forward  NN  trained  by  the  Levenberg- 
Marquardt  algorithm  is  considered  to  forecast  the  wind  power 
sub-series  by  Wavelet  Transform  (WT)  in  [54],  The  obtained 
forecast  sub-series  reconstruct  the  future  wind  power  series  by 
inverse  WT.  The  proposed  approach  is  compared  with  the  refer¬ 
ence  to  demonstrate  its  effectiveness  regarding  forecasting  accu¬ 
racy  and  computation  time.  Similarly,  in  [55],  Adaptive  Wavelet 
Neural  Network  (AWNN)  is  carried  out  to  forecast  the  wind  speed 
in  stage-I.  Then,  Feed-forward  Neural  Network  (FNN)  is  used  to 
transfer  the  wind  speed  forecast  to  the  wind  power  forecast  in 
stage-II. 

In  [56],  standard  multilayer  Feed-forward  Neural  Network 
(FNN)  is  employed  to  forecast  the  wind  speed.  The  non-linear 
and  non-stationary  wind  speed  series  is  first  decomposed  into  a 
finite  and  often  small  number  of  Intrinsic  Mode  Functions  (IMFs) 
and  one  residual  series  to  obtain  the  stationary  data  series  using 
the  Empirical  Mode  Decomposition  (EMD).  FNN  is  performed  to 
forecast  the  decomposed  sub-series  except  the  high  frequency, 
whose  input  variables  are  selected  by  Partial  Autocorrelation 
Function  (PACF).  Finally,  the  forecast  sub-series  are  summed  to 
reconstruct  the  forecast  of  wind  speed  series.  The  results  show 


that  the  proposed  approach  outperforms  the  reference.  Further¬ 
more,  a  similar  approach  based  on  ANN  using  EMD  is  found 

in  [57], 

Generalized  Feed-forward  Neural  Network  (GFNN)  is  used  to 
forecast  the  annual  wind  speed  probability  density  distribution  in 
[58],  The  same  parameters  required  by  the  Weibull  function  are 
used  as  inputs.  The  proposed  method  shows  better  accuracy  to 
forecast  the  wind  speed  probability  density  distribution  than  the 
Weibull  function. 

Multilayer  Feed-forward  Neural  Network  (MFNN)  is  used  to 
forecast  a  day-ahead  wind  power  forecasting  in  [59].  The  NWP 
model  is  established  by  coupling  the  Global  Forecasting  System 
(GFS)  with  the  Weather  Research  and  Forecasting  (WRF)  System. 
Furthermore,  the  Kalman  filter  is  integrated  to  reduce  the  sys¬ 
tematic  error  from  WRF.  The  proposed  method  is  proved  to  be 
successful  by  the  simulation. 

In  [60],  Kernel  Ridge  Regression  (KRR)  with  active  learning  is 
performed  for  wind  speed  forecast.  The  active  learning  acts  as  a 
smart  sample  selector  capable  to  yield  compact  forecast  models 
and  also  to  filter  out  noisy  training  samples.  Three  active  learning 
methods  to  construct  the  training  set  for  KRR  are  proposed;  Pool 
of  regressors  (PAL),  Distance  from  the  closest  training  sample 
(DAL),  and  Residual  regression  (RSAL).  The  experimental  results 
show  that  smart  collection  of  training  samples  can  be  of  benefit  for 
wind  speed  forecast. 

ANN  has  been  a  good  selection  for  wind  speed  and  power 
forecasting.  ANN  is  able  to  model  a  complex  non-linear  relation¬ 
ship  and  extract  the  dependence  between  the  input  and  output 
through  the  learning  process.  It  is  simple  to  construct  and  only 
requires  short  development  times,  and  does  not  require  explicit 
mathematical  expressions.  ANN  can  be  designed  by  setting  up  the 
network  structure  and  then  selecting  a  learning  algorithm. 

The  first  step  with  ANN  is  to  choose  the  network  structure, 
where  network  structures  derive  from  two  basic  classes:  1  -  A 
feed-forward  topology  where  information  flows  in  only  one 
direction,  from  inputs  to  outputs;  2  -  A  recurrent  topology  where 
information  flows  in  both  directions,  from  input  to  output  and  also 
from  output  to  input. 

The  second  step  with  ANN  is  to  select  the  learning  algorithm 
and  to  learn  the  proper  response.  This  is  achieved  with  three 
learning  paradigms:  1  -  Supervised  learning  that  sets  ANN  para¬ 
meters  from  training  data;  2  -  Un-supervised  learning  that 
sets  ANN  parameters  based  on  given  data  and  a  cost  function;  3 
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-  Reinforcement  learning  that  sets  ANN  parameters,  where  data  is 
generated  by  interactions  with  the  environment.  Each  learning 
paradigm  has  many  learning  algorithms.  Some  commonly  used 
algorithms  include  evolutionary  methods,  gene  expression  pro¬ 
gramming,  simulated  annealing,  expectation-maximization,  non- 
parametric  methods,  and  particle  swarm  optimization. 

2.1.2.3.  ANN-Fuzzy  approach.  The  Fuzzy  logic  approach  is  a  non¬ 
linear  mapping  of  input  variables  into  the  output  by  using 
numerical  and  linguistic  values  Fuzzy  rather  than  fixed  and 
exact.  It  used  soft  linguistic  variable  (small,  medium,  and  large) 
and  a  truth  variable  that  ranges  in  degree  between  0  and  1.  The 
Fuzzy  logic  approach  is  employed  when  the  system  is  difficult  to 
model  accurately  but  an  inexact  model  is  available,  because  it 
allows  to  use  the  approximate  values  and  incomplete  or 
ambiguous  data.  Flowever,  using  the  Fuzzy  logic  approach  alone 
is  not  satisfying  because  of  its  weak  learning  ability. 

The  ANN-Fuzzy  approach  refers  to  combination  of  ANN  and 
Fuzzy  logic  approaches.  ANN  is  based  on  the  learning  and 
connectionist  structure  that  performs  well  in  low-level  computa¬ 
tion  with  raw  data.  However,  Fuzzy  logic  performs  well  with  the 
human-like  reasoning  in  high-level  computation.  Therefore,  the 
combination  of  two  approaches  offers  a  promising  approach  to 
forecast  the  wind  speed  and  power  by  compensating  their  weak¬ 
ness  each  other. 

In  [61],  ANN  is  combined  with  Fuzzy  logic  approach  in  order  to 
optimize  the  best  use  of  the  NWPs.  First,  the  ANN  model  provides 
the  preliminary  forecast  of  wind  power  based  on  NWPs.  Then,  the 
Fuzzy  model  estimates  the  quality  of  the  forecasts  provided  by 
NWPs.  Finally,  this  is  exploited  by  an  ANN  model  to  provide  the 
final  forecasts.  The  simulation  results  show  the  validity  of  the 
proposed  method. 

The  hybrid  Wavelet  Transform  (WT)-Particle  Swarm  Optimiza¬ 
tion  (PSO)-Adaptive  Network  based  Fuzzy  Inference  System 
(ANFIS)  approach  (hybrid  WPA  approach)  is  proposed  for  short¬ 
term  wind  power  forecasting  in  [62].  The  wavelet  transform  is 
performed  to  convert  a  wind  power  series  in  a  set  of  constitutive 
sub-series.  The  transformed  sub-series  are  separately  forecasted 
by  the  ANFIS.  The  PSO  is  used  to  adjust  the  parameters  of  the 
membership  function  in  ANFIS.  The  hybrid  WPA  approach  outper¬ 
forms  seven  other  reference  approaches. 

The  Simultaneous  Perturbation  Stochastic  Approximation 
(SPSA)-based  Multilayer  Feed-forward  Neural  Network  (MFNN) 
with  Fuzzy  inputs  algorithm  is  presented  in  [63],  The  SPSA  is 
employed  to  achieve  the  training  task  of  the  MFNN.  The  inputs  for 
the  MFNN  are  modeled  by  triangle  fuzzy  numbers.  A  better 
performance  of  the  proposed  method  is  shown  than  the  reference 
method.  However,  it  cannot  be  used  for  multistep  forecasting. 

ANN-Fuzzy  approaches  are  complementary  tools  in  building 
intelligent  forecasting  systems.  While  ANN  performs  well  in  low- 
level  computational  structures,  Fuzzy  logic  deals  with  reasoning 
incorporated  in  the  higher-level  computational  structures.  Mer¬ 
ging  of  the  ANN  approach  and  the  Fuzzy  logic  approach  into  one 
integrated  system  offers  a  promising  approach  to  building  wind 
speed  and  power  forecasting  systems. 

2. 1.2.4.  Etc.  A  Gaussian-Process-based  method  for  forecasting  the 
upper  and  the  lower  bounds  of  wind  speed  as  well  as  the  average 
is  proposed  in  [64],  It  makes  use  of  the  kernel  machine  technique  and 
the  Bayesian  estimation.  The  simulation  results  show  that  they 
provide  the  better  forecasting  in  consideration  of  the  uncertainty 
than  the  reference.  Furthermore,  the  advanced  Gaussian-Process- 
based  method,  called  the  Sparse  Heteroscedastic  Gaussian  Process 
(SPHGP)  is  proposed  in  [65].  This  method  is  developed  to  make  up  the 
misestimation  of  the  noise  level  from  standard  GP  and  reduces  the 


computational  costs  from  HGP.  The  ^]/2  regularizer  is  employed  for 
the  sparsification.  The  simulation  results  demonstrate  the  effective¬ 
ness  of  the  proposed  method. 

In  [66],  the  modified  Taylor  Kriging  (TK)  method  for  forecasting 
wind  speed  time  series  is  proposed.  The  original  TK  method  is 
developed  for  the  spatial  estimation.  In  this  paper,  the  original  TK 
method  is  properly  modified  for  the  time  series  forecasting.  The 
simulation  results  indicate  that  the  proposed  method  outperforms 
the  ARIMA  method. 

The  Grey  forecasting  model  (CM)  is  presented  for  short-term 
wind  speed  forecast  in  [67].  The  Grey  model  is  designed  for  the 
system  analysis  being  characterized  by  inadequate  information. 
The  GM(1,1)  Grey  model,  a  single  variable  first-order  Grey  model, 
is  used.  The  simulation  results  demonstrate  the  efficiency  of  the 
proposed  model. 

In  [68],  two-stage  hybrid  network  with  Bayesian  Clustering  by 
Dynamic  (BCD)  and  Support  Vector  Regression  (SVR)  is  proposed. 
First,  BCD  clusters  the  input  training  dataset  into  several  subsets 
with  similar  dynamical  properties.  Then,  SVRs  is  used  to  model  the 
training  data  in  each  subset  having  similar  property.  Experiments 
and  comparisons  with  the  persistence  method  demonstrate  the 
effectiveness  and  efficiency  of  the  proposed  method. 

The  forecasting  of  three-dimensional  wind  field  is  considered 
in  [69],  To  exploit  the  available  second  order  statistics,  the 
augmented  quaternion  statistics  is  employed  and  the  adaptive 
Widely  Linear  Quaternion  Least  Mean  Square  (WL-QLMS)  algo¬ 
rithm  for  the  forecasting  is  used  to  allow  for  both  three- 
dimensional  wind  model  and  the  fusion  of  atmospheric  para¬ 
meters.  It  is  shown  that  additional  information  from  the  vertical 
wind  speed  and  atmospheric  parameters  assist  in  providing  better 
forecast. 

In  [70],  the  Bayesian  Model  Averaging  (BMA)  approach  with 
Markov  Chain  Monte  Carlo  (MCMC)  is  proposed  for  developing  the 
long-term  wind  speed  distribution.  The  derived  BMA  probability 
density  function  (PDF)  is  an  average  of  many  candidate  models 
included  in  the  model  space  weighted  by  their  posterior  prob¬ 
abilities  over  the  sample  data.  The  case  studies  reveal  that  none  of 
reference  distribution  outperforms  others;  however,  the  BMA 
distribution  is  always  suitable  to  describe  the  wind  speed  dis¬ 
tribution  with  high  accuracy. 

The  forecasting  model  integrating  the  concepts  of  structural 
breaks  and  Bayesian  inferences  is  proposed  for  very  short-term 
wind  speed  forecasting  in  [71],  It  allows  prior  information  about 
the  wind  speeds  to  be  incorporated  into  the  model  and  somehow 
boosts  forecasting  performance  by  considering  structural  breaks. 
The  computational  results  confirm  the  advantages  of  the  proposed 
method.  However,  there  are  several  limitations;  for  example,  the 
computing  efficiency  is  not  high  and  the  size  of  training  sample  is 
not  large  enough.  Further  research  is  needed  to  cure  these 
limitations. 

In  [72],  First-order  and  Second-order  Adaptive  Coefficient  (FAC 
and  SAC)  methods  are  used  to  wind  speed  forecasting.  Particle 
Swarm  Optimization  (PSO)  is  employed  to  select  the  most  suitable 
parameter  of  FAC  and  SAC  methods.  Furthermore,  Seasonal  Expo¬ 
nential  Adjustment  (SEA)  is  employed  to  decompose  the  wind 
speed  data  into  seasonal  and  trend  components.  Different  combi¬ 
nations  of  these  techniques  are  simulated.  The  combination 
obtains  higher  forecasting  accuracy  than  the  single  FAC  and  SAC. 

Data  mining  approaches  in  wind  energy  system  have  more 
attention  in  recent  years  [12].  The  time  series  models  are  built 
with  data  mining  algorithms  for  the  short-term  forecasting  in  [73], 
Five  different  data  mining  algorithms  are  employed;  the  Support 
Vector  Machine  Regression  (SVMreg),  Multilayer  Perception 
(MLP),  M5P  tree,  Reduced  Error  Pruning  (REP)  tree,  and  the 
bagging  tree  algorithms.  The  computational  results  of  all  time 
series  models  and  data  mining  algorithms  are  compared  and 
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discussed.  Furthermore,  this  approach  is  applied  to  forecast  the 
wind  power  ramp  rates  in  [74].  One  of  the  disadvantage  of  the 
proposed  approach  is  that  the  time  series  model  uses  its  own 
previously  forecasted  value.  Therefore,  the  forecast  accuracy 
decreases  when  the  number  of  forecast  steps  increase.  In  [75],  a 
data  mining  approach  is  expanded  to  forecast  both  short-term  and 
long-term  period.  Furthermore,  an  advanced  model,  the  ANN 
approach,  is  employed.  The  simulation  results  show  that  the 
model  generated  by  an  ANN  outperforms  all  other  models  for 
both  short-term  and  long-term  forecast. 

The  frequency  domain  approach  is  introduced  for  characteriz¬ 
ing  the  wind  speed  patterns  in  [76].  This  approach  separates  the 
discrete  waveform  into  a  sum  of  sinusoids  of  different  frequencies 
from  wind  speed  patterns.  It  shows  a  potential  to  forecast  future 
wind  speed  patterns  with  better  accuracy  than  time  domain. 
Furthermore,  it  also  shows  a  potential  to  estimate  the  wind  speed 
patterns  for  a  target  location  using  available  data  from  the 
reference  site.  It  is  necessary  to  develop  the  appropriate  forecast¬ 
ing  system  using  the  components  obtained  from  the  frequency 
domain  approach. 


2.1.3.  Combination  approach 

The  basic  idea  of  the  combination  approach  is  to  combine 
different  approaches,  retaining  advantages  of  each  approach.  The 
desire  is  to  improve  the  forecast  accuracy,  but  combining  forecasts 
does  not  always  perform  better  than  the  best  individual  forecasts. 
However,  in  some  cases  it  is  viewed  as  less  risky  to  combine 
forecasts  than  to  select  an  individual  forecast  [77].  In  this  section, 
the  combination  forecasting  method  is  only  reviewed.  The  combi¬ 
nation  of  input  data  (NWPs)  is  discussed  in  Section  4. 

Several  approaches  have  been  developed  based  on  the  combi¬ 
nation  of  various  models.  The  ANN-Fuzzy  approach  discussed  in 
the  previous  section  is  one  of  the  combination  approaches.  In  [78], 
the  final  forecast  is  made  through  an  adaptive  linear  combination 
of  the  alternative  forecasts,  where  the  weights  given  to  each 
forecasting  methods  are  based  on  their  forecasting  performance. 
For  the  combination,  nine  different  time  series  models  are  con¬ 
sidered  by  varying  the  model  parameters.  The  proposed  combina¬ 
tion  shows  better  performance  than  individual  forecasts. 

The  Anemos  project  considers  the  combination  method,  called 
Adaptive  Exponential  Combination  (AEC)  in  [79].  In  the  first  step, 
several  combination  methods  are  used,  AEC  being  one  of  them.  In 
the  second  step,  the  AEC  method  is  used  to  combine  the  alter¬ 
native  combinations  of  the  first  step.  The  results  show  the 
usefulness  of  the  proposed  method. 

In  [80],  a  Multiple  Architecture  System  (MAS)  for  wind  speed 
forecasting  is  proposed.  The  different  regression  algorithms  are 
considered  as  candidate  models  for  making  up  the  ensemble 
forecast;  the  Multiple  Linear  (MLR)-based  regression,  the  Multi¬ 
layer  Perceptron  (MLP)  neural  networks,  Radial  Basis  Functions 
(RBF)  neural  networks,  and  Support  Vector  Machines  (SVM) 
regression.  Three  combination  strategies  are  explored;  simple 
average,  weighted  average,  and  non-linear  fusion  by  means  of  an 
ANN  method.  The  proposed  combination  strategies  improve  the 
performance  with  respect  to  the  single  forecast. 

In  [81],  two  types  of  univariate  statistical  forecasting  models 
based  on  the  Least  Squares-Support  Vector  Machine  (LS-SVM)  are 
proposed:  the  univariate  LS-SVM  model  and  the  hybrid  model  of 
ARIMA  and  LS-SVM  model.  The  hybrid  approach  is  designed  to 
capture  both  the  linear  ARIMA  modeled  and  the  non-linear 
patterns  modeled  by  LS-SVM.  However,  there  are  no  significant 
improvements  between  LS-SVM  and  the  hybrid  model. 

The  combination  approach  between  the  Radial  Basis  Function 
(RBF)  neural  networks  and  persistence  methods  is  considered  in 
[82],  The  experimental  results  show  that  the  RBF  method  is  better 


for  monotonically  wind  speed  changes  while  the  persistence 
method  is  more  suitable  for  random  data  with  white  noise.  The 
simulation  results  demonstrate  improvements  in  the  combination 
approach. 

In  [83],  two  combination  approaches,  namely  ARIMA-ANN  and 
ARIMA-SVM,  are  proposed.  The  ARIMA  method  is  employed  for 
modeling  the  linear  characteristics  and  the  ANN  or  SVM  methods 
are  employed  for  modeling  the  non-linear  characteristics.  The 
results  show  that  the  combination  is  viable  for  forecasting  both 
wind  speed  and  wind  power.  However,  the  combined  approaches 
do  not  always  outperform  the  single  method.  A  similar  combina¬ 
tion  approach  is  also  presented  in  [84], 

In  [85],  two  combination  approaches  are  proposed;  ARIMA- 
ANN  and  ARIMA-Kalman.  ARIMA  is  used  to  decide  the  best 
structure  of  the  ANN  model  and  to  obtain  the  best  initial 
parameters  of  the  Kalman  model.  Both  combination  approaches 
have  good  performance,  which  can  be  applied  to  non-stationary 
wind  speed  forecasting. 

The  Bayesian  Model  Averaging  (MBA)  approach  is  proposed  to 
combine  the  wind  speed  forecasts  obtained  from  different  ANN 
models  in  [86],  Three  different  ANN  models  are  considered; 
Adaptive  Linear  Element  (ADALINE)  network,  Back-Propagation 
(BP)  network,  and  Radial  Basis  Function  (RBF)  network.  The  BMA 
approach  weights  individual  forecasts  based  on  their  posterior 
model  probabilities,  with  the  better  performing  forecasts  receiving 
higher  weights  than  the  worse.  The  simulation  results  show  the 
effectiveness  of  the  proposed  combination  approach. 

In  [87],  the  combination  of  the  Soft  Computing  Models  (SCMs) 
and  Similar  Days  (SD)  methods  is  considered.  Three  SCMs  models 
are  considered,  including  Back-Propagation  Neural  Network 
(BPNN),  Radial  Basis  Function  Neural  Network  (RBFNN),  and 
Adaptive  Neuro-Fuzzy  Inference  System  (ANFIS).  The  average  of 
a  selected  number  of  similar  wind  speed  days  from  SD  methods  is 
used  for  the  input  of  SCMs  and  the  application  of  SCM  refines  the 
results  to  obtain  the  final  wind  speed  forecast.  The  simulation 
results  show  that  all  evaluated  SCMs  incur  some  level  of  perfor¬ 
mance  improvement  with  the  addition  of  the  SD  model.  Further¬ 
more,  the  SD-ANFIS  model  outperforms  all  other  individual  and 
combined  models. 

The  combination  between  Seasonal  ARIMA  (SARIMA)  and  Least 
Square  Support  Vector  Machine  (LSSVM)  is  developed  to  forecast 
the  average  monthly  wind  speed  in  [88],  The  SARIMA  model  is 
developed  by  using  training  data  and  the  LSSVM  model  is  devel¬ 
oped  to  describe  the  residuals  from  the  SARIMA  model.  The 
simulation  results  show  that  the  proposed  method  is  simple  and 
efficient. 

The  main  objective  of  the  combination  approach  is  to  improve 
performance  by  taking  advantage  of  the  strengths  of  each  model¬ 
ing  approach.  Combinations  can  consist  of  physical  and  statistical 
or  alternative  statistical  approaches.  The  advantage  of  combining 
results  from  different  modeling  approaches  is  improved  forecast 
accuracy.  Furthermore,  the  combination  of  models  reduces  risks 
during  extreme  events  such  as  storms,  where  some  model  types 
may  have  significant  forecasting  errors. 


2.2.  Spatial  correlation  forecasting 

Typically,  wind  speed  and  power  forecasting  refers  to  the 
future,  and  the  forecast  horizon  depends  on  power  system  opera¬ 
tion  requirements.  Spatial  correlation  forecasting,  on  the  other 
hand,  is  mostly  used  for  characterizing  the  wind  resource  at  the 
site  where  sufficient  information  is  not  available  but  a  neighboring 
measuring  station  is  available.  It  is  a  useful  indicator  when 
performing  wind  energy  potential  assessments  at  sites  without 
wind  measurements. 
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Several  approaches  can  be  found  regarding  the  problem  of 
spatial  correlation  forecasting.  The  most  commonly  used  approach 
is  the  Measure-Correlate-Predict  (MCP)  method  [89],  It  is  widely 
used  for  future  wind  integration  studies.  The  method  is  used  for 
the  long-term  wind  data  at  the  target  site  using  the  wind  data 
obtained  from  the  reference  site.  Four  different  MCP  methods  are 
introduced:  Linear  Regression,  Matric  MCP,  Weibull  Scale,  and 
Wind  Index  MCP  model.  It  is  found  that  all  perform  well,  but  the 
performance  is  of  course  dependent  upon  the  condition  of  the 
available  data. 

Furthermore,  in  [90],  the  MCP  method  is  used  to  forecast  the 
long-term  wind  speed  and  then,  the  Monte  Carlo  based  numerical 
simulation  is  performed  to  utilize  the  probability  models  at  the 
target  site.  This  model  is  used  for  assessing  the  uncertainty  of 
overall  wind  energy  potential  prior  to  the  construction  of  wind 
turbines.  For  the  case  study  it  was  found  that  the  method 
effectively  evaluated  the  expected  annual  energy  production  at 
the  site. 

The  Bayesian  hierarchical  model  is  developed  to  characterize 
the  wind  resource  at  the  desired  location  using  the  known 
resource  [91],  The  hierarchical  model  has  two  levels.  The  wind 
speed  data  from  the  reference  site  is  defined  as  the  sum  of  a 
temporal,  a  spatial,  and  error  components  together  at  the  first 
level.  At  the  second  level,  a  temporal  and  a  spatial  component  are 
modeled  as  a  first  order  random  walk  and  a  multivariate  normal 
distribution  respectively,  it  is  found  that  the  Bayesian  inference 
can  be  a  useful  tool  in  spatial  correlation  forecasting. 

In  [92],  the  wind  speed  at  “upwind"  remote  sites  can  be  used 
for  forecasting  at  "downwind”  sites.  The  ANN  approach  is  used  to 
forecast  the  wind  speed  at  the  target  site  using  the  “upwind”  and 
“downwind”  site  measurements.  It  is  found  that  the  short-term 
wind  speed  forecasting  is  improved  when  the  recent  measure¬ 
ments  from  the  reference  site  are  used. 

In  [93],  the  ANN  approach  is  used  to  estimate  the  wind  speed  at 
the  target  site  using  the  wind  speed  at  a  strong  correlation  site 
among  nearby  sites.  A  high  value  of  the  Sample  Cross  Correlation 
Function  (SCCF)  indicates  the  strongest  correlation  site.  The 
simulation  results  indicate  that  higher  SCCF  values  between  two 
sites  leads  to  better  wind  speed  estimates.  However,  this  method, 
contrary  to  other  methods,  requires  the  measurements  from  both 
the  target  and  reference  sites  to  calculate  their  SCCF. 

A  Takagi,  Sugeno,  and  Kang  (TSK)  Fuzzy  approach  is  proposed 
to  forecast  the  wind  speed  and  power  at  the  target  site  using  the 
wind  speed  and  direction  measured  at  neighboring  sites  in  [94], 
The  Genetic  Algorithm  (GA)  based  learning  scheme  is  employed  to 
achieve  the  training  task  of  the  model.  The  performance  is 
evaluated  at  various  terrain  cases  and  demonstrates  significant 
improvements  over  the  persistent  model. 

In  [95],  a  method  for  long-term  wind  speed  and  power 
forecasting  is  proposed  by  using  the  NWPs  provided  at  nearby 
sites  of  the  wind  park.  Three  types  of  local  recurrent  neural 
network  models,  depending  on  the  neuron  dynamic  model,  are 
employed:  the  Infinite  Impulse  Response  Multilayer  Perceptron 
(IIR-MLP),  the  Local  Activation  Feedback  Multilayer  Network  (LAF- 
MLN),  and  the  Diagonal  Recurrent  Neural  Network  (DRNN). 
Furthermore,  two  learning  schemes  for  updating  the  weights  of 
the  model  are  employed;  the  Global  Recursive  Prediction  Error 
(GRPE)  and  the  Decoupled  RPE  (DRPE).  It  is  shown  that  the 
proposed  methods  outperform  the  static  and  the  persistence 
models. 

The  ANN  approach  is  used  to  estimate  the  wind  speed  at  a 
target  station  in  [96],  The  wind  speeds  recorded  at  neighboring 
measuring  stations  are  used  as  signals  for  the  input  layer  of  the 
model.  It  is  found  that  as  the  number  of  reference  stations 
increase,  the  estimation  errors  tend  to  decrease.  Furthermore, 
when  the  wind  direction  is  used  as  an  input,  greater  error 


reduction  is  achieved  than  when  the  wind  direction  signal  is 
omitted. 

Being  different  from  traditional  wind  speed  and  power  fore¬ 
casting,  spatial  correlation  forecasting  takes  into  account  the 
spatial  relationship  of  wind  speed  and  power  from  different  sites. 
The  spatial  correlation  forecasting  requires  wind  speed  and  power 
measurements  from  multiple  spatial  correlated  sites,  where  the 
measurements  often  involve  time  delays.  The  measurements  and 
their  time  delays  adds  complexity  and  cost  to  the  implementation 
of  spatial  correlation  forecasting.  Spatial  correlation  forecasting 
can  either  employ  physical  models,  which  take  into  account  the 
terrain  information,  or  statistical  models,  which  take  spatial 
correlation  information  into  account. 

2.3.  Regional  forecasting 

Regional  forecasting  is  to  forecast  aggregated  power  output 
from  a  region  where  a  number  of  wind  turbines  are  sited.  The 
regional  forecasting  is  required  for  the  market  operator  to  estimate 
the  total  wind  power  production  in  a  region.  It  provides  a  faster 
approach  than  calculating  the  sum  of  the  wind  power  forecast 
from  each  wind  farm.  Furthermore,  it  is  more  accurate  because  of 
its  spatial  smoothing  effects  discussed  in  Section  4.  Several 
approaches  are  proposed  for  regional  forecasting. 

Linear  up-scaling  in  Previento  is  used  for  the  regional  forecast¬ 
ing  in  [36],  The  ratio  between  the  fluctuation  of  the  wind  power 
output  of  the  measured  single  site  and  the  ensemble  of  the  sites  is 
used  for  up-scaling.  It  shows  how  this  method  works  practically  in 
Germany. 

In  [97],  the  aggregated  forecasting  method  using  the  distances 
between  wind  speed  forecasts  for  a  set  of  selected  coordinates  to 
calculate  wind  power  forecasts  in  a  region  is  proposed.  Once  the 
distance  is  obtained,  the  wind  power  forecast  is  calculated  by 
comparing  the  new  wind  speed  forecast  with  the  stored  historical 
wind  speed  forecasts  and  their  corresponding  total  wind  power 
measured  in  the  region.  The  accuracy  of  the  proposed  method  is 
verified  by  comparing  the  results  with  the  reference  method. 

In  [98],  an  ANN-Fuzzy  approach  based  on  the  up-scaling 
approach  is  used  to  forecast  the  regional  wind  generation.  Three 
different  approaches  are  introduced  based  on  how  much  the 
NWPs  and  on-line  SCADA  data  are  available:  FNN  up-scaling, 
Cascaded  model,  and  FNN  cluster  model.  The  results  show  that 
the  overall  performance  is  improved  but  it  is  difficult  to  beat  the 
persistence  model  for  the  first  look-ahead  time.  SCADA  data  is 
necessary  to  enhance  the  performance  in  the  first  forecast  horizon. 

A  finite  state  Markov  Chain  model  is  developed  to  forecast  the 
aggregated  power  output  from  a  wind  farm  in  [99],  The  model 
takes  into  account  both  the  spatial  and  temporal  dynamics  of  wind 
power  output.  For  spatial  dynamics,  a  rigorous  step-by-step 
procedure  to  characterize  the  probability  distribution  of  the 
aggregated  power  output  is  presented.  The  temporal  dynamics  is 
characterized  using  auto-regression  analysis  tools. 

The  models  of  regional  forecasting  are  mostly  based  on  the  up- 
scaling  approach.  It  is  difficult  to  obtain  the  wind  power  measure¬ 
ments  and  NWPs  from  all  the  wind  farms  in  a  region.  Therefore, 
the  up-scaling  approach  is  used  to  scale  the  available  on-line 
measurements  and  NWPs  up  to  the  region.  There  is  not  much 
work  reported  dealing  with  regional  forecasting.  When  wind 
energy  penetration  increases  significantly  good  solutions  to  regio¬ 
nal  forecasting  will  be  needed. 

2.4.  Probabilistic  forecasting 

The  forecasting  models  discussed  so  far  provide  the  point 
forecast  for  how  much  wind  speed  and  power  can  be  expected 
for  a  number  of  hours  into  the  future.  It  is  important  to  know  not 
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only  the  point  forecast  but  also  its  expected  uncertainty.  Knowing 
the  uncertainty  of  the  forecast  enables  the  system  operator  to 
assess  the  risk  of  the  point  forecast.  Different  approaches  can  be 
found  in  the  literature  regarding  the  problem  of  probabilistic 
forecast.  In  this  section,  some  of  the  representative  methods  are 
discussed. 


2.4.1.  Parametric  approach 

If  there  is  any  underlying  assumption  on  the  distribution  one 
tries  to  model  for  the  forecasting  error,  it  is  a  parametric  approach. 

A  Beta  Probability  Density  Function  (PDF)  is  used  to  model  a 
PDF  of  the  wind  power  forecast  for  the  persistence  model  in  [100], 
First,  the  forecast  results  are  divided  into  power  classes  or  bins  for 
modeling  the  distribution  of  power  with  the  Beta  PDF.  The  overall 
forecast  error  PDF  is  obtained  from  adding  up  the  calculated  error 
PDFs  of  all  bins.  The  results  show  the  usefulness  of  the  proposed 
forecast  error  PDF  for  finding  the  optimal  storage  size. 

The  Cauchy  distribution  is  proposed  as  a  model  distribution  for 
the  forecast  error  for  the  persistence  model  in  [101].  The  shape  of 
the  distribution  is  found  to  change  significantly  with  the  forecast¬ 
ing  horizon.  The  effect  of  the  proposed  wind  power  forecast 
uncertainty  is  illustrated. 

There  is  little  literature  available  on  the  parametric  approach 
due  to  the  difficulty  of  defining  the  stochastic  nature  of  wind 
speed. 


2.4.2.  Non-parametric  approach 

Contrary  to  the  parametric  approach,  the  non-parametric 
approach  does  not  rely  on  any  assumption  about  the  distribution. 

A  non-parametric  approach  based  on  kernel  density  estimation 
is  proposed  to  provide  the  complete  wind  power  forecast  dis¬ 
tribution  in  [102],  The  distribution  of  wind  power  is  decomposed 
into  a  continuous  part  corresponding  to  all  occurrences  of  wind 
speed  having  zero  production  and  a  discrete  part  corresponding  to 
the  discrete  probabilities  of  all  production  values.  The  method 
levels  with  the  references  while  being  fast  and  producing  the 
complete  PDF. 

The  kernel  density  estimation  is  expanded  to  a  time-adaptive 
model  and  to  adopt  different  kernels  for  several  types  of  variables 
in  [103].  The  kernel  density  estimation  model  based  on  the 
Nadaraya-Watson  estimator  is  proposed  to  estimate  the  uncer¬ 
tainty  in  short-term  wind  power  forecasts.  It  includes  the  adoption 
of  specific  kernels  for  the  explanatory  variables  and  the  develop¬ 
ment  of  a  time  adaptive  model.  It  confirms  its  value  from  testing 
with  real  data  from  two  large  wind  farms  in  the  U.S.  and  shows 
better  results  based  on  the  evaluation  criteria  in  [104].  Further¬ 
more,  kernel  density  estimation  based  on  the  time-adaptive 
quantile-copula  estimator  is  presented  in  [105], 

In  [106],  the  generic  methodology  for  assessing  the  forecast  risk 
is  presented.  First,  confidence  intervals  based  on  the  resampling 
approach  are  calculated  by  considering  the  forecast  horizon,  the 
power  class  and  the  cut-off  risk.  Furthermore,  the  Meteorological 
Risk  Index  (MRI)  evaluating  the  weather  stability  is  used  to  tune 
the  confidence  intervals.  The  proposed  method  is  generic  and  can 
be  applied  to  any  wind  forecasting  models.  However,  it  does  not 
seem  to  have  very  much  influence  on  the  quantile  estimation  in 
[107],  The  most  influential  variable  is  the  forecasted  wind  power 
production.  Furthermore,  the  results  show  increased  uncertainty 
for  westerly  winds.  Also,  the  effect  of  the  horizon  on  the  quantiles 
is  minor. 

Probabilistic  wind  power  forecasts  using  Local  Quantile  Regres¬ 
sion  (LQR)  is  proposed  in  [108].  This  approach  does  not  require  the 
distribution  assumption  and  is  easy  to  include  more  predictive 
information.  The  simulation  results  demonstrate  its  effectiveness. 


In  [109],  a  probabilistic  forecasting  method  based  on  the 
Markov  Chain  model  is  proposed.  This  provides  not  only  the 
estimates  of  future  wind  power  generation  but  also  the  associated 
probability  distributions.  First  Order  Markov  Chain  (FOMC)  and 
Second  Order  MC  (SOMC)  are  considered.  The  proposed  models 
are  compared  with  the  persistence  model  to  evaluate  forecast 
errors. 

In  [110],  the  proposed  probabilistic  model  is  designed  to 
forecast  eight  quantiles  of  the  wind  power  distribution  using 
multiple  Radial  Basis  Function  Neural  Networks  (RBFNNs).  The 
main  idea  is  to  exploit  all  information  influencing  the  forecasting 
errors  in  order  to  estimate  the  wind  power  density  function. 
The  three  uncertainties  considered  are  the  weather  stability,  the 
structure  of  the  forecasting  model,  and  the  NWP  accuracy.  The 
developed  RBFNNs  are  trained  with  the  Ordinary  Orthogonal  Least 
Square  (OLS)  algorithm  and  their  performance  is  improved  using  a 
Particle  Swarm  Optimization  (PSO)  algorithm.  The  simulation 
results  show  that  the  proposed  method  performs  well  in  different 
weather  conditions  and  different  terrains. 

In  [111],  the  sparse  Warped  Gaussian  Process  (WGP)  model  is 
formulated  to  provide  short-term  probability  forecasts  of  the  wind 
power  generation.  It  converts  a  non-Gaussian  wind  power  series 
to  a  latent  series,  which  is  well  modeled  by  a  Gaussian  process. 
Furthermore,  a  sparsification  method  is  employed  to  reduce  the 
computational  costs  of  the  model.  The  simulation  results  validate 
the  effectiveness  of  the  proposed  model. 

Non-parametric  approaches  are  used  when  it  is  not  possible  to 
formulate  the  distribution  of  the  forecast  errors.  This  distribution- 
free  approach  is  appealing  since  it  is  difficult  to  define  the 
stochastic  nature  of  wind.  Furthermore,  the  wind  speed  and  power 
forecast  are  very  different  depending  on  the  forecast  time  horizon 
and  location.  Non-parametric  approaches  can  be  applied  in  this 
case,  and  as  a  result  are  suitable  to  estimate  the  uncertainty  of  the 
wind  speed  and  power  forecast. 


2.4.3.  Using  ensemble  forecast 

Ensemble  forecasting  methods  are  designed  to  take  advantage 
of  ensemble  NWPs.  The  ensemble  NWPs  are  obtained  by  running 
differently  calibrated  NWP  models  or  by  slightly  varying  the  initial 
conditions,  in  order  to  obtain  a  set  of  different  forecasts  [112], 
Different  types  of  meteorological  ensemble  forecasts  are  provided 
either  by  the  European  Centre  for  Medium-range  Weather  Fore¬ 
casts  (ECMWF)  or  the  National  Centre  for  Environmental  Predic¬ 
tion  (NCEP).  A  MultiScheme  Ensemble  Prediction  System  (MS-EPS) 
is  introduced  to  utilize  the  ensemble  NWPs  for  short-term  wind 
power  forecasting  [113], 

In  [114],  skill  forecasting  based  on  ensemble  forecasts  of  wind 
power  production  is  introduced  to  estimate  the  expected  level  of 
forecast  uncertainty.  Prediction  risk  indices  based  on  the  disper¬ 
sion  of  wind  power  ensembles  over  a  single  forecast  horizon  or 
over  a  set  of  successive  look-ahead  times  are  introduced.  Then, 
these  indices  are  used  to  forecast  the  distribution  of  expected 
forecast  errors  as  skill  forecasts.  The  simulation  results  show  that 
the  proposed  skill  forecasting  can  provide  useful  information  on 
the  expected  level  of  forecast  uncertainty. 

The  wind  power  density  forecasts  using  the  calibrated  and 
smoothed  ensemble  forecast  are  proposed  in  [115].  The  systematic 
bias  of  the  ensemble  forecasts  is  corrected  by  the  calibration 
approach  that  also  incorporates  kernel  smoothing,  with  para¬ 
meters  optimized  using  maximum  likelihood.  The  proposed  power 
density  forecasts  provide  more  accurate  results  for  wind  power 
density  and  point  forecasting  than  the  reference. 

The  ensemble  forecasts  give  a  much  better  idea  of  what 
weather  events  may  occur  in  the  future.  By  comparing  different 
forecasts,  it  identifies  the  expected  spread  of  weather  conditions 
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and  assesses  the  probability  of  particular  weather  events.  When 
the  forecasts  vary  significantly  there  is  a  lot  of  uncertainty. 
However,  if  forecasts  are  very  similar,  then  more  confidence  can 
be  placed  in  the  forecasts.  The  ensemble  forecast  is  useful  in  both 
wind  speed  and  power  forecasting. 

2.4.4.  Additional  models 

In  [116],  the  relation  between  typical  weather  situations  and  the 
corresponding  wind  speed  forecast  error  is  investigated  to  model 
the  uncertainty  of  the  forecast.  First,  the  Cluster  analysis  using  the 
Principal  Component  Analysis  (PCA)  rather  than  complex  meteorolo¬ 
gical  information  is  used  to  group  days  with  similar  meteorological 
conditions  into  classes.  Then,  for  each  of  these  groups  the  forecast 
error  is  determined  separately.  It  is  found  that  different  meteorological 
situations  have  significant  differences  in  the  forecasting  error. 

In  [117],  it  is  necessary  to  assess  both  temporal  and  spatial 
interdependences  in  order  to  evaluate  the  aggregate  error  for  the 
regional  uncertainty  estimates.  In  this  manner,  the  spatial  model¬ 
ing  of  the  wind  power  forecast  uncertainty  is  proposed,  which 
is  important  for  the  optimal  management  of  power  flows  in 
the  large  region.  The  improvement  of  the  probabilistic  forecasting 
is  discussed. 

The  spatio-temporal  modeling  of  wind  power  forecast  uncer¬ 
tainty  considering  the  interdependence  of  both  time  and  space  is 
proposed  in  [118],  It  is  shown  that  there  is  significant  cross¬ 
correlation  between  forecast  errors  obtained  from  neighboring 
areas  with  lags  of  a  few  hours.  These  cross-correlation  patterns 
characterized  by  wind  speed  and  direction  show  that  a  higher 
prevailing  wind  results  in  a  stronger  dependency  on  remote 
places;  while  a  lower  prevailing  wind  results  in  a  stronger 
dependency  on  local  places.  A  further  study  is  needed  for  the 
generalization  by  the  methodology. 

In  [119],  the  uncertainty  of  wind  power  forecasting  regarding 
the  meteorological  situation  is  investigated.  From  the  simulation 
results,  the  uncertainty  of  wind  speed  forecasts  is  not  dependent 
on  the  magnitude  of  the  wind  speed  forecast.  Furthermore,  the 
uncertainty  is  larger  for  low  pressure  situations  than  high  pressure 
situations.  However,  some  sites  do  not  show  these  differences. 

A  framework  for  evaluating  the  quality  of  probabilistic  fore¬ 
casting  of  wind  power  is  presented  in  [104].  The  framework  is 
composed  of  measures  and  diagrams,  with  the  aim  of  providing 
useful  information  on  each  of  the  properties  including  reliability, 
sharpness  and  resolution,  and  a  unique  skill  score.  The  framework 
is  applied  for  comparing  the  quality  of  two  non-parametric 
methods;  adaptive  quantile  regression  and  adapted  re-sampling. 
The  relevance  and  interest  of  the  proposed  framework  are 
discussed. 

In  [120],  a  methodology  for  using  the  variability  in  wind  speed 
forecasts  at  multiple  grid  points  in  NWP  systems  is  presented  to 
forecast  the  uncertainty  in  future  wind  power  generation.  The 
basic  concept  is  that  a  major  source  of  uncertainty  in  NWP 
forecasts  arises  from  potential  misplacement  of  weather  features. 
The  comparison  between  the  proposed  method  and  NWP  ensem¬ 
bles  is  performed.  It  demonstrates  good  accuracy  in  forecasting 
large  forecast  errors. 

A  wide  comparison  is  carried  out  between  the  ARMA  models, 
five  kinds  of  ANN  models,  and  Adaptive  Network  based  Fuzzy 
Inference  System  (ANFIS)  models  in  [121  ].  The  comparison  looks  at 
many  forecasting  methods,  time  horizons  and  a  performance 
analysis.  A  further  analysis  of  the  statistical  distribution  of  normal¬ 
ized  errors  from  the  forecasting  models  is  performed. 

2.5.  Offshore  forecasting 

Nowadays,  offshore  wind  has  a  lot  of  attention  and  is  expected 
to  be  installed  more  in  the  future  because  of  more  frequent  and 


powerful  winds.  However,  the  offshore  wind  forecasting  is  still  in 
the  early  phases  of  the  development.  There  are  not  many  available 
specific  forecasting  models  for  offshore  in  the  literature.  Further 
advances  in  offshore  forecasting  will  help  offshore  wind  penetra¬ 
tion  levels. 

Most  forecasting  models  have  been  designed  for  onshore.  For 
the  physical  approach  to  offshore  forecasting,  the  special  meteor¬ 
ological  characteristic  of  the  marine  atmospheric  boundary  layer 
has  to  be  considered  including  the  stability  of  the  boundary  layer, 
wind-wave  interaction,  etc.,  [122],  However,  the  statistical 
approach  does  not  need  to  consider  the  precise  offshore  condi¬ 
tions.  Results  from  various  offshore  forecasting  methods  are 
investigated  in  [123], 

In  [124],  the  evaluation  of  NWPs  forecasts  for  offshore  sites  is 
performed.  Then,  for  the  physical  forecasting  approach,  the  special 
meteorological  characteristics  of  the  marine  boundary  layer  are 
considered  to  calculate  the  wind  speed  at  the  hub  height  of  the 
wind  turbines  by  using  the  derived  vertical  wind  profile.  The 
effectiveness  of  the  proposed  method  is  demonstrated  by  compar¬ 
ing  with  measurements. 

In  [61,114,125,126],  some  of  the  statistical  approaches  use 
offshore  data  as  an  input,  but  the  specific  technique  for  dealing 
with  offshore  data  is  not  discussed. 

3.  Accuracy  of  wind  speed  and  power  forecasting 

3.3.  Reference  models 

A  number  of  reference  models  have  been  introduced  as  bench¬ 
mark  models.  These  models  may  be  used  for  initial  testing  of  the 
accuracy  of  new  approaches  to  wind  forecasting.  A  number  of 
existing  reference  models  are  reviewed  in  this  section. 

3.3.3.  Persistence  model 

The  persistence  model  is  a  commonly  used  benchmark  model. 
In  this  model  the  future  wind  power  will  be  the  same  as  occurred 
in  the  present  time  step  as  given  by 

Pt+k\t  =  Pt  (2) 

where  Pt+qt  is  the  forecast  at  time  t  for  the  look  ahead  time  k  and 
P[  is  the  measurement  at  time  t.  This  model  performs  well  for 
short  forecasting  horizons. 

3.3.2.  Weighted  sum  between  persistence  and  the  mean  power 
production 

The  persistence  model  performs  well  for  the  short-term 
forecast.  However,  it  is  not  reasonable  to  use  the  persistence 
model  when  the  forecast  time  horizon  is  more  than  a  few 
hours.  For  this  case,  a  new  reference  model  has  been  proposed 
[127],  It  is  a  weighted  sum  between  the  persistence  and  the  mean 
given  as 

Pt  +  k\t  =  Of[f>r  +  (l  —  ak)P  (3) 

where  Pt  is  the  measurement  at  time  f,  P  is  the  estimated  mean, 
and  ak  is  the  correlation  coefficient  between  Pt  and  Pt+k  defined  as 

fWt.  ,4, 

where 

Pt  =  Pt-P  (5) 

3.3.3.  OL-Persistence  model 

For  regional  forecasting,  total  regional  power  is  seldom  avail¬ 
able.  Therefore,  the  On-Line  (OL)  Persistence  model  is  introduced 
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for  the  regional  forecasting  as  a  benchmark  model  [32].  This  model 
is  the  sum  of  the  production  of  the  representative  wind  farms  with 
measurements,  scaled  to  the  total  wind  power  and  is  given  as 


hreg  _ 

1  t+k\t  ~ 


preg 
1  inst 

ZMr  pwf 

i  —  1  ^ inst,i 


(6) 


Where  Pimt  is  the  total  installed  capacity  in  the  region,  P^t  j  is  the 
installed  capacity  of  the  Mr  wind  farms  for  which  on-line  mea¬ 
sures  are  available,  and  is  the  power  measured  at  those  wind 
farms  at  time  t. 


3.2.  Model  performance  evaluation 

To  determine  the  model  performance  in  terms  of  the  forecast¬ 
ing  accuracy,  several  model  accuracy  measures  are  discussed  in 
this  section. 

The  forecast  error  is  the  difference  between  the  measurement 
and  the  forecast  and  is  given  as 

er+k\t  =  ^r+k  —  (7) 

Sometimes,  the  normalized  forecast  error  is  used.  The  forecast  error 
is  normalized  by  the  installed  capacity  (Pinst)  as 

£t+k\t =  n  er+k|f  (8) 

1  inst 

To  assess  the  quality  of  the  forecast,  several  evaluation  criteria 
are  used;  Bias,  Mean  Absolute  Error  (MAE),  Mean  Absolute  Per¬ 
centage  Error  (MAPE),  Mean  Square  Error  (MSE),  Root  Mean  Square 
Error  (RMSE),  and  Standard  Deviation  of  Errors  ( SDE )  [128,129],  as 
given  by 


1  n 

Bwsk=Avg(et+klt)  =  ek  =  1-  £  et+klt  (9) 

t=  1 

MAEk  =  l  I  |et+qt|  (10) 

iV  t=  1 

MAPEk  =  l-  Z  to  OD 

Nt=l  Pt  +  k 

MSEk  =  X  i^t+k\t)  ("12) 

RMSEk  =  v7 MSTk  =  ^  (et+k,t)2  03) 

SDEk  =  Std(ec+klc)  =  J ^  Z^et+kit~ek)2  (14) 


where  N  is  the  total  number  of  samples,  and  Avg(  ■ )  and  Std{  ■ ) 
denote  the  average  and  the  standard  deviation  respectively. 
Furthermore,  the  RMSEk  can  be  expressed  by  the  combination 
between  Biask  and  the  SDEk  as 


3.3.  Forecast  accuracy 

Using  the  advanced  forecasting  models  introduced  in  Section  2, 
it  is  possible  to  provide  better  forecasts  of  wind  speed  and  power 
than  the  reference  models.  However,  it  is  very  difficult  to  say 
which  model  is  the  best  because  of  its  site  dependency.  Although 
the  forecasting  model  performs  well,  it  does  not  guarantee  that 
the  model  works  well  at  another  site.  Therefore,  how  the  general 
forecast  accuracy  varies  is  discussed  in  this  section. 

3.3.1.  The  accuracy  by  forecast  time  horizon 

The  forecast  accuracy  decreases  when  prediction  length 
increases.  The  MAE  of  short-term  forecasts  is  typically  in  the  range 
of  5-15%  and  then  the  errors  increase  rapidly  with  an  increase  in 
the  forecast  time  horizon.  Thus,  the  MAE  is  typically  in  the  range 
of  13-21%  range  for  1-2  days  ahead  and  rises  to  20-25%  range 
after  about  3  days  [10], 

3.3.2.  The  accuracy  by  location  complexity 

In  [79],  the  average  value  of  the  normalized  MAE  (NMAE)  for 
the  12  h  forecast  horizon  is  compared  based  on  the  terrain 
complexity  of  the  wind  farm.  Much  smaller  NAME  values  in  a  flat 
terrain  wind  farm  (low  complexity  location)  are  observed  than  in 
complex  terrains  (high  complexity  location).  There  is  a  significant 
increase  of  the  NMAE  value  when  the  terrain  complexity  is 
increased.  Furthermore,  offshore  wind  farms  have  slightly  higher 
NMAE  values  relative  to  the  flat  terrain  wind  farm. 

3.3.3.  The  accuracy  by  seasonal  variability 

In  [130],  the  performance  of  the  forecasting  models  is  related  to 
the  seasonal  variability.  Smaller  forecasting  errors  are  observed  in 
winter  than  summer  because  of  the  higher  level  of  wind  speeds 
and  the  larger  uncertainty  in  the  summer  stormy  weather  situa¬ 
tions,  in  particular  low  pressure  systems  with  fast  moving  frontal 
zones  containing  complicated  wind  patterns. 

3.3.4.  The  accuracy  by  weather  condition 

In  [106],  the  forecasting  error  is  compared  as  a  function  of  the 
Meteorological  Risk  Index  (MRI)  evaluating  the  weather  stability. 
The  forecasting  error  increases  linearly  when  MRI  increases  (the 
higher  MRI  represents  unstable  weather  regimes).  Furthermore,  in 
[119],  the  forecasting  uncertainty  does  not  depend  on  the  wind 
speed  but  on  the  pressure.  The  forecasting  error  in  the  low 
pressure  situation  is  larger  than  the  high  pressure  situation.  These 
patterns  are  also  observed  in  [116], 


4.  Improvement  in  forecasting  performance 

Development  activities  are  on-going  in  reducing  the  forecasting 
error  in  wind  speed  and  power  forecasting,  spatial  correlation 
forecasting,  regional  forecasting,  probabilistic  forecasting,  and  off¬ 
shore  forecasting.  In  this  section,  some  of  the  potential  options  to 
help  improve  the  forecasting  accuracy  are  discussed. 


RMSE2k  =  Bias2k+SDE2k 


(15) 


In  the  model  comparison,  it  is  important  to  quantify  the 
improvements  of  the  advanced  model  over  the  benchmark  model. 
In  this  manner,  the  improvement  with  respect  to  the  reference 
model  is  introduced  as  [128,129] 


ImpEC  t  — 


ECrefk-ECa, 


EC, 


ref.k 


(16) 


where  EC  stands  for  the  evaluation  criteria  such  as  MAE,  RMSE, 
SDE,  etc. 


4.3.  Kalman  filtering 

It  is  well  known  that  the  accuracy  of  NWPs  has  a  major  impact 
on  the  forecasting  of  wind  speed  and  power.  Therefore,  reducing 
the  uncertainty  of  NWPs  can  lead  to  significant  forecasting 
improvements.  To  improve  the  accuracy  of  NWPs,  the  Kalman 
filtering  algorithm  is  applied  to  filter  out  systematic  errors.  The 
Kalman  filtering  algorithm  provides  the  statistically  optimal  esti¬ 
mate  recursively  by  combining  recent  weighted  observations 
which  minimize  the  corresponding  biases. 
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In  [131],  the  improvements  in  wind  speed  forecasting  with  the 
application  of  the  Kalman  filtering  are  investigated.  Third-order 
polynomial  functions  are  implemented  in  the  Kalman  filtering 
algorithm.  For  the  evaluation,  two  NWP  models  having  different 
characteristics  and  horizontal  resolutions  are  tested.  The  simula¬ 
tion  results  show  that  the  forecasting  accuracy  is  significantly 
improved  by  using  the  filtered  NWPs.  The  significant  forecast 
improvement  using  the  Kalman  filtering  can  be  also  found  in 
[59,132,133], 


4.2.  Optimal  combination 

The  combination  of  different  NWPs  or  forecasting  models  takes 
advantage  from  each  model.  The  advantage  of  combining  the 
models,  of  course,  improves  the  forecast  accuracy.  Furthermore, 
the  combination  can  reduce  the  risk  from  extreme  events.  The 
combination  of  forecasting  methods  is  discussed  in  Section  2,  and 
the  optimal  combination  can  significantly  reduce  forecast  error. 
Here,  the  optimal  combination  of  NWPs  is  discussed. 

In  [134,135],  a  classification  scheme  is  used  to  find  optimal 
combinations  for  specific  weather  situations.  The  different  NWP 
models  are  exploited  by  applying  the  optimal  weights  depending 
on  the  prevailing  weather  conditions.  It  shows  that  the  optimal 
combination  of  NWPs  significantly  outperforms  others  even  if  the 
single  NWP  is  already  very  well  optimized.  The  combination  of 
different  NWPs  is  performed  based  on  the  quality  of  individual 
forecasts  in  [125,136], 

In  [137],  the  combination  between  measured  data  and  multiple 
wind  forecasting  models  is  proposed  by  advanced  machine  learn¬ 
ing  techniques.  The  input  sources  are  the  wind  forecasts  profiles, 
derived  by  synoptic  and  local  atmospheric  models,  and  the  actual 
climatic  variable  evolution  measured  by  local  environmental 
stations.  The  simulation  results  demonstrate  the  effectiveness  of 
the  proposed  data  fusion  algorithms. 


4.3.  Spatial  smoothing  effects 

Under  operational  conditions,  regional  forecasting  is  required 
for  the  system  operator  to  estimate  the  total  wind  power  produc¬ 
tion.  The  fluctuations  of  the  combined  power  output  of  distributed 
wind  farms  are  damped,  which  results  in  decrease  in  fluctuations 
of  the  regional  power  output  compared  to  the  forecast  for  single 
sites.  This  is  called  spatial  smoothing  effects.  These  effects  provide 
the  motivation  for  developing  regional  forecasting  models  rather 
than  calculating  the  sum  of  the  wind  power  forecast  from  single 
sites.  Several  forecasting  models  are  discussed  in  Section  2.3.  The 
reduction  of  the  forecast  error  is  found  in  the  literature  [138,139], 


4.4.  Statistical  downscaling 

NWPs  are  primary  inputs  to  forecasting  models.  So  improving 
NWPs  can  help  to  improve  wind  speed  and  power  forecasting. 
NWPs  are  only  available  for  grid  points  which  generally  corre¬ 
spond  to  areas  that  are  larger  than  the  wind  farms  themselves. 
Therefore,  estimating  the  wind  speed  at  the  wind  farm  location 
can  provide  more  accurate  wind  speed  forecast  as  an  input  to  the 
forecasting  models.  The  basic  idea  of  statistical  downscaling  is  to 
use  higher  resolution  calculations  by  adding  more  complete 
physics  descriptions  to  capture  more  precise  local  conditions. 
The  analysis  results  show  that  wind  power  forecasting  using 
the  statistical  down  scaled  NWPs  can  improve  the  performance 
[140-142],  Furthermore,  some  of  the  advanced  NWP  models  are 
examined  in  [143-145], 


4.5.  Input  parameter  selection 

The  selection  of  the  input  parameters  for  the  forecasting  model 
is  crucial  to  the  performance  of  the  forecasts.  Several  input 
parameters  are  investigated  in  the  literature  for  the  purpose  of 
reducing  the  forecasting  error. 

In  [146],  the  inclusion  of  off-site  observations  with  NWP 
models  can  increase  the  forecast  accuracy.  Despite  the  inherent 
large  forecast  error  from  NWPs,  adding  off-site  observations  leads 
to  increases  in  forecast  accuracy.  Furthermore,  as  the  number  of 
reference  stations  providing  the  measurements  increase,  the 
forecasting  errors  tend  to  decrease  [96],  The  improvements  by 
the  off-site  observations  can  be  the  motivation  for  developing 
spatial  correlation  forecasting  which  forecasts  wind  speed  and 
power  at  target  sites  by  using  the  observations  from  neighboring 
measuring  stations,  as  described  in  Section  2.2. 

Several  meteorological  parameters  as  inputs  have  been  con¬ 
sidered  for  improving  the  performance  of  the  forecast.  The 
inclusion  of  wind  direction  with  wind  speed  can  reduce  the 
forecasting  error  [96,116,118,126],  Sometimes  the  wind  vector 
applying  both  wind  speed  and  direction  is  used  [45,52,69],  Besides 
the  obvious  importance  of  the  wind  speed  as  an  input  parameter, 
temperature  and  pressure  can  help  improve  forecasts  [69,116,147], 

In  [148],  the  inclusion  of  the  wind  speed  forecast  at  wind 
turbine  hub  height  shows  significant  improvements  in  forecast 
accuracy. 

Furthermore,  transformed  data  rather  than  wind  speed  itself 
show  potential  for  improving  the  forecast.  Several  transformation 
techniques  are  used:  wavelet  decomposition  [41,42,54,55,62], 
empirical  mode  decomposition  [56,57],  and  frequency  domain 
approach  [76], 

4.6.  Power  curve  modeling 

When  converting  the  wind  speed  to  wind  power  production, 
inaccuracies  in  the  non-linear  relation  leads  to  further  errors. 
Wind  power  and  wind  speed  are  related  through  a  cubic  relation¬ 
ship.  Thus,  small  variations  in  the  wind  speed  result  in  much 
larger  deviations  in  the  wind  power.  The  use  of  the  certified 
manufacture  power  curves  does  not  guarantee  accurate  conver¬ 
sions.  Using  an  accurate  relation  between  the  wind  speed  and 
power  can  greatly  minimize  conversion  errors  when  converting 
the  wind  speed  forecast  to  the  wind  power  forecast. 

The  conventional  wind  power  curve  model  is  extended  to  a 
wind  farm  model  by  the  addition  of  the  wind  direction  depen¬ 
dence  with  the  wind  speed  dependence.  Such  a  two  dimensional 
wind  farm  power  curve  model  may  help  to  provide  more  accurate 
relations  between  wind  speed  and  power  [50,140,142], 

Statistical  techniques  rather  than  the  wind  power  curve  may  be 
used  to  describe  the  non-linear  relation  between  wind  speed  and 
wind  power.  The  polynomial  regression  is  used  to  estimate  the 
wind  power  by  using  the  explanatory  variables  such  as  wind 
speed,  direction,  etc.  Furthermore,  non-linear  hyperbolic  functions 
containing  trigonometric  terms  which  describe  the  non-linear 
behavior  are  used  [133].  Sometimes,  advanced  statistical 
approaches  (ANN,  Fuzzy  logic,  etc.)  are  used  to  estimate  wind 
power  from  the  various  inputs  [55],  However,  such  a  relation  does 
not  need  to  be  described  when  the  wind  power  is  forecasted 
directly  from  the  raw  input  data.  However,  it  is  required  when 
forecasting  the  wind  speed  and  then  converting  it  to  the  power. 

4.7.  Forecasting  parameter 

One  of  the  most  important  decisions  in  a  wind  power  system 
is  choosing  which  parameters  will  be  used  in  the  forecast.  The 
forecaster  can  choose  the  wind  speed  alone  or  the  wind  speed  and 
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direction  or  the  wind  vector  as  the  forecasting  parameters.  Then, 
the  forecasting  wind  information  is  converted  to  the  wind  power 
production.  Or  the  forecaster  can  forecast  the  wind  power  produc¬ 
tion  directly.  Furthermore,  the  forecaster  can  choose  regional  or 
probabilistic  forecasting.  Choosing  the  appropriate  forecasting 
parameters  can  improve  the  performance  [149], 


5.  Conclusion  and  future  works 

5.1.  Discussion  and  conclusion 

Today's  power  system  is  facing  growing  challenges  in  main¬ 
taining  a  secure  and  reliable  energy  supply.  Part  of  the  growing 
challenge  is  the  possibility  of  significant  levels  of  uncertain  wind 
generation  being  installed  on  power  system.  This  brings  new 
challenges  involving  the  management  of  wind  intermittence. 
Accurate  forecasting  of  wind  power  is  necessary  to  meeting  these 
challenges. 

The  main  objectives  of  wind  speed  and  power  forecasting  are 
to  estimate  the  wind  speed  and  power  as  quickly  and  accurately  as 
possible.  With  more  accurate  forecasting  integration  of  large 
amounts  of  wind  power  into  power  system  can  be  achieved. 
Accurate  forecasting  tools  reduce  the  financial  risk  and  lead  to 
improved  scheduling  and  unit  commitment  plans. 

This  paper  provides  an  overview  of  forecasting  tools  that  are 
available  for  wind  generation.  The  wind  speed  and  power  fore¬ 
casting  approaches  reviewed  have  different  characteristics,  with 
different  approaches  providing  better  results  for  different  forecast 
time  horizons  and  locations.  This  paper  divides  forecasting  meth¬ 
ods  into  five  categories:  wind  speed  and  power  forecasting,  spatial 
correlation  forecasting,  regional  forecasting,  probabilistic  forecast¬ 
ing,  and  offshore  forecasting.  An  overview  of  existing  research  in 
each  category  is  presented. 

Several  wind  speed  and  power  forecasting  approaches,  includ¬ 
ing  physical,  statistical,  and  combination  approaches,  have  been 
implemented  and  are  now  commercially  available.  The  statistical 
approaches  provide  good  results  in  the  majority  of  cases,  including 
short-term,  medium-term,  and  long-term  forecasting.  However,  in 
the  very  short-term  and  short-term  horizon,  the  influence  of 
atmospheric  dynamics  becomes  more  important,  so  that  the  use 
of  the  physical  approaches  becomes  necessary. 

The  statistical  forecasting  approaches,  such  as  conventional 
statistical  approaches,  ANN,  and  ANN-Fuzzy,  use  large  amounts  of 
historical  data  as  input.  The  conventional  statistical  approaches  are 
mostly  aimed  at  very  short-term  and  short-term  forecasting.  ANN 
is  able  to  model  complex  non-linear  relationships  and  extract  the 
dependence  between  the  inputs  and  outputs  through  learning 
and  training.  Fuzzy  logic  outperforms  others  when  dealing 
with  reasoning  problems,  while  learning  and  training  abilities 
are  mediocre.  The  merging  of  the  ANN  approach  with  the  Fuzzy 
logic,  creating  the  ANN-Fuzzy  approach,  provides  excellent  perfor¬ 
mance. 

It  is  difficult  to  say  which  model  is  the  best  because  different 
models  perform  differently  in  different  situations.  The  combina¬ 
tion  approach  attempts  to  leverage  the  strengths  of  different 
modeling  approaches.  This  approach  is  able  to  not  only  improve 
the  forecast  accuracy,  but  also  reduces  the  risk  from  extreme 
events. 

In  order  to  select  a  suitable  location  of  wind  generation,  it  is 
necessary  to  assess  the  wind  power  sufficiency  of  the  neighboring 
regions.  In  this  case,  spatial  correlation  forecasting  may  be  applied 
to  estimate  the  characteristics  of  the  wind  resource  at  sites  where 
insufficient  information  is  not  available.  Current  models  use 
functions  that  describe  the  wind  flow  depending  on  the  distance 
between  two  sites  by  selectively  using  the  topography  of  the  area, 


the  height  of  the  measuring  mast,  the  cross  correlation  factor,  or 
other  relative  meteorological  data.  These  models  are  more  difficult 
than  typical  time-ahead  forecasting  models  because  they  require 
measurements  from  many  spatially  correlated  sites  and  the 
associated  measurement  delay  times. 

Regional  forecasting  has  become  more  important  for  market 
operators  to  estimate  the  total  wind  power  production  in  a  region. 
Furthermore,  it  has  been  shown  that  regional  forecasts  provide 
faster  and  more  accurate  results  than  single  farm  forecasts  because 
of  the  spatial  smoothing  effects.  However,  few  studies  have  looked 
into  the  particularities  of  regional  forecasting.  When  wind  energy 
penetration  increases  significantly  solutions  to  this  problem  will 
be  needed. 

Probabilistic  forecasting  gives  an  estimate  of  where  the  wind 
production  will  lie  within  a  certain  degree  of  confidence.  In  this 
review  some  of  the  representative  methods  are  considered, 
including  the  parametric  approach,  non-parametric  approach, 
and  ensemble  forecast.  There  are  not  many  parametric  approaches 
available  due  to  the  complex  stochastic  nature  of  wind  speed. 
Thus,  non-parametric  approaches,  and  distribution-free 
approaches,  have  been  more  appealing.  In  addition,  ensemble 
forecasting  models  for  probabilistic  forecasting  are  used  in  order 
to  obtain  the  expected  spread  of  weather  conditions  and  assess  the 
probability  of  particular  weather  events. 

In  future,  major  wind  power  developments  are  expected  to  be 
installed  offshore.  More  frequent  and  powerful  wind  speeds 
represent  the  main  advantages  of  offshore  power  production. 
However,  offshore  wind  forecasting  is  in  an  early  phase  of 
development.  All  state-of-the-art  forecasting  models  were  origin¬ 
ally  designed  for  onshore.  For  physical  forecasting  approaches 
special  modeling  considerations  have  to  be  considered  for  off¬ 
shore.  However,  with  statistical  approaches  precise  knowledge  of 
offshore  conditions  is  not  needed. 

Using  advanced  wind  speed  and  power  forecasting  approaches 
introduced  in  Section  2,  it  is  possible  to  provide  significantly  better 
forecasts  than  the  simplistic  or  reference  forecasting  approaches. 
However,  it  is  difficult  to  say  which  model  is  the  best  because  wind 
speed  and  power  forecasting  is  not  “plug-in-play"  due  to  the  site- 
dependence.  Therefore,  an  evaluation  of  accuracy  relative  to  the 
forecasting  model  performance  is  discussed.  The  general  forecast¬ 
ing  accuracy  is  then  considered  along  with  the  level  of  forecasting 
errors  that  can  be  expected  depending  on  the  forecast  time 
horizon,  location  complexity,  seasonal  variability,  and  weather 
conditions.  The  forecasting  accuracy  goes  down  with  increasing 
forecast  time  horizon,  terrain  complexity,  and  unstable  weather 
regimes. 

Finally,  several  potential  techniques  for  improving  the  accuracy 
of  the  models  are  discussed:  (1)  Kalman  filtering  of  systematic 
errors  coming  from  NWPs.  (2)  The  combination  of  different  NWPs 
or  forecasting  models.  (3)  Combining  power  outputs  of  distributed 
wind  farms  providing  spatial  smoothing  effects.  (4)  Estimating 
the  wind  speed  at  wind  farm  locations  by  statistical  down 
scaling.  (5)  Appropriate  selection  of  input  parameters.  (6) 
Improved  power  curve  modeling.  (7)  Choosing  the  appropriate 
forecasting  parameters. 

5.2.  Future  work 

The  reviewed  wind  speed  and  power  forecasting  approaches  in 
this  paper  provide  a  foundation  for  future  research.  Future  work  to 
be  considered  includes  the  following: 

•  Further  improvements  in  both  physical  and  statistical  forecast¬ 
ing  approaches  and  improved  approaches  for  the  combination 
of  different  forecasting  approaches  can  be  expected  to  further 
reduce  forecasting  errors. 
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•  Extending  the  forecasts  further  into  the  future,  enabling  long¬ 
term  planning. 

•  Regional  forecasting  is  useful  in  feasibility  studies  of  prospec¬ 
tive  installations  and  assessment  of  the  potential  impact  on 
system  reliability.  The  choice  of  reference  sites  and  their 
combinations  should  be  considered  in  regional  forecasting. 

•  Forecasting  the  aggregated  wind  power  on  a  regional  basis 
would  be  useful  for  management  of  total  wind  power  produc¬ 
tion.  It  would  also  be  useful  to  detect  extreme  events  so  that 
market  operators  can  manage  trading  as  wind  power  ramps  up 
and  down. 

•  Probabilistic  forecasting  could  provide  probability  distributions 
of  the  forecasting  errors,  providing  the  opportunity  to  reduce 
the  reserve  capacity  for  balancing  wind  power  forecast  errors. 

•  The  development  of  operational  ensemble  forecasts  using  the 
data  from  several  NWPs  would  help  improve  the  forecast 
accuracy. 

•  Improved  offshore  forecasting  models  are  needed  to  improve 
the  offshore  forecasts  that  are  currently  being  done  with 
models  developed  for  onshore. 

•  New  approaches  for  complex  terrains  are  required.  These 
approaches  could  incorporate  higher  resolution  maps. 

•  Improving  and  tuning  the  NWP  model  to  predict  the  hub- 
height  wind  would  help  improve  the  forecasting  accuracy.  It  is 
sometimes  required  to  forecast  accurate  pressure,  temperature, 
and  other  related  weather  parameters. 

•  Additional  use  of  online  wind  measurement  data  has  the 
potential  for  improved  forecasts,  especially  for  very  short¬ 
term  and  short-term  wind  power  forecasting. 

•  Further  research  on  automatic  adaptive  parameter  estimation 
is  required  since  wind  speed  and  power  forecasting  is  very  site- 
dependent. 

•  Increasing  the  spatial  and  time  resolution  of  NWPs,  better 
taking  into  account  local  phenomena,  would  help  to  improve 
the  forecast. 

•  Forecasting  extreme  events,  as  opposed  to  common  behavior, 
would  be  very  valuable  to  protecting  the  power  system. 
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